TUSQ: Targeted High-Utility Sequence Querying

نویسندگان

چکیده

Significant efforts have been expended in the research and development of a database management system (DBMS) that has wide range applications for managing an enormous collection multisource, heterogeneous, complex, or growing data. Besides primary function (i.e., create, delete, update), practical impeccable DBMS can interact with users through information selection, is, querying their targets. Previous algorithms, such as frequent itemset sequential pattern (SPQ) focused on measurement frequency, which does not involve concept utility, is helpful to discover more informative patterns. To apply technology wider applications, we incorporate utility into target-oriented SPQ formulate task targeted utility-oriented sequence querying. address proposed problem, develop novel algorithm, namely high-utility (TUSQ), based two upper bounds (suffix remain terminated descendants utility) well vertical last instance table. For further efficiency, TUSQ relies projection utilizing compact data structure called chain. An extensive experimental study conducted several real synthetic datasets shows algorithm outperformed designed baseline terms runtime, memory consumption, candidate filtering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Declarative Querying for Biological Sequence Databases

The ongoing revolution in life sciences research is producing vast amounts of genetic and proteomic sequence data. Scientists want to pose increasingly complex queries on this data, but current methods for querying biological sequences are primitive and largely procedural. This limits the ease with which complex queries can be posed, and often results in very inefficient query plans. There is a...

متن کامل

ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data

We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTML pages (the so-called structured Web). It illustrates a two-phase querying of the Web, in which an intentional description of the targeted data is first provided, in a flexible and widely applicable manner. ObjectRunn...

متن کامل

A New Algorithm for High Average-utility Itemset Mining

High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...

متن کامل

A High Performance XML Querying Architecture

Data exchange on the Internet plays an essential role in electronic business (e-business). A recent trend in e-business is to create distributed databases to facilitate data exchange. In most cases, the distributed databases are developed by integrating existing systems, which may be in different database models, and on different hardware and/or software platforms. Heterogeneity may cause many ...

متن کامل

Semantic High Level Querying in Sensor Networks

The quick development and deployment of sensor technology within the general frame of the Internet of Things poses relevant opportunity and challenges. The sensor is not a pure data source, but an entity (Semantic Sensor Web) with associated metadata and it is a building block of a “worldwide distributed” real time database, to be processed through real-time queries. Important challenges are to...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Big Data

سال: 2023

ISSN: ['2372-2096', '2332-7790']

DOI: https://doi.org/10.1109/tbdata.2022.3175428